RAW SEQUENCE LISTING DATE: 07/23/2001 

PATENT" APPLICATION: US/09/900,575 TIME: 13:37:27 



Input Set : A:\FimH Peptides.ST25.txt 
Output Set: N:\CRF3\07232001\I900575.raw 



3 <110> 



APPLICANT: Langermann, Solomon 
Revel, Andrew 
Auguste, Christine 
Burlein, Jeanne 



ENTERED 



4 
5 
6 



8 <120> TITLE OF INVENTION: FimH Adhesin Proteins and Methods of Use 
10 <130> FILE REFERENCE: 469201-549 
C--> 12 <140> CURRENT APPLICATION NUMBER: US/09/900,575 
C--> 12 <141> CURRENT FILING DATE: 2001-07-06 

12 <150> PRIOR APPLICATION NUMBER: US/60/216,750 

13 <151> PRIOR FILING DATE: 2000-07-07 
15 <160> NUMBER OF SEQ ID NOS : 64 

17 <170> SOFTWARE: Patentln version 3.0 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 837 

21 <212> TYPE: DNA 

22 <213> ORGANISM: E. coli 

24 <400> SEQUENCE: 1 

25 ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgccaatgtt 
27 tatgtaaacc ttgcgcccgt cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 
29 caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 
31 ggctcggctt atggcggcgt gttatctaat ttttccggga tcgtaaaata tagtggcagt 
33 agctatcctt tccctaccac cagcgaaacg ccgcgcgttg tttataattc gagaacggat 
35 aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggggg agtggcgatt 
37 aaagcaggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 
39 ggtttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 
41 tgcgatgctt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 
43 attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 
45 accgcaggtg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 
47 gtcggcgtac agttggcgcg caacggtacg gttattccag cgaataacac ggtatcgtta 
49 ggagcagtag ggacttcggc ggtgagtctg ggattaacgg caaattacgc acgtaccgga 
51 gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 

54 <210> SEQ ID NO: 2 

55 <211> LENGTH: 837 

56 <212> TYPE: DNA 

57 <213> ORGANISM: E. coli 

59 <400> SEQUENCE: 2 

60 ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgccaatgtt 
62 tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 
64 caaatctttt gccataacga ttacccggaa accattacag actatgtcac actgcaacga 
66 ggttcggctt atggcggcgt gttatctagt ttttccggga tcgtaaaata taatggcagt 
68 agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 
70 aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggggg agtggcgatt 
72 aaagcaggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 
74 gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 
76 tgcgatgctt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 
78 attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 
80 accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 
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RAW SEQUENCE LISTING DATE: 07/23/2001 

PATENT APPLICATION: US/09/900,575 TIME: 13:37:27 

Input Set : A:\FimH Peptides.ST25.txt 
Output Set: N:\CRF3\07232001\I900575.raw 

82 gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 
84 ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 
86 gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 

89 <210> SEQ ID NO: 3 

90 <211> LENGTH: 837 

91 <212> TYPE: DNA 

92 <213> ORGANISM: E. coli 

94 <400> SEQUENCE: 3 

95 ttcgcctgta aaaccgccaa tggtacagct atccctattg gcggtggcag cgctaatgtt 60 
97 tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 
99 caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 
101 ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 
103 agctatccat ttccgaccac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 
105 aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg ggtggcgatt 360 
107 aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 
109 gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc tactggcggc 480 
111 tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgcca 540 
113 attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 
115 accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc agcgcagggc 660 
117 gtcggcgttc agttgacgcg caacggtacg attattccca cgaataacac ggtatcgtta 720 
119 ggagcagtac ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga ■ 780 
121 gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 

124 <210> SEQ ID NO: 4 

125 <211> LENGTH: 840 

126 <212> TYPE: DNA 

127 <213> ORGANISM: E. coli 

129 <400> SEQUENCE: 4 

130 ttcgcctgta aaaccgccaa tggtaccgca atccctattg gcggtggcag cgccaatgtt 60 
132 tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 
134 caaatctttt gccataacga ttacccagaa accattacag actatgtcac actgcaacga 180 
136 ggtgcggctt atggcggcgt gttatctagt ttttccggga ccgtaaaata taatggcagt 240 
138 agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 300 
140 aagccgtggc cggtggcgct ttatttgacg ccggtgagca gtgcgggggg agtggcgatt 360 
142 aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 
144 gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 
146 tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg actaccctgg ttcagtgccg 540 
148 attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 
150 accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 
152 gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 
154 ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 
156 gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaataa 840 

159 <210> SEQ ID NO: 5 

160 <211> LENGTH: 840 

161 <212> TYPE: DNA 

162 <213> ORGANISM: E. coli 

164 <400> SEQUENCE: 5 

165 ttcgcctgta aaaccgccaa tggtaccgct attcctattg gcggtggcag cgctaatgtt 60 
167 tatgtaaacc ttgcgcctgc cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 
169 caaatctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 
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RAW SEQUENCE LISTING DATE ■ 07/91/9001 

PATENT APPLICATION : US/09/900,575 Till, ll'ly 27 

Input Set : A:\FimH Peptides.ST25.txt 
Output Set: N:\CRP3\07232001\I900575.raw 



^73 ITclttl^t ^ g9C9 ^ 9t Sttatctaat ttttccggga ccgtaaaata tagtggcagt 
^75 HZloT ttccgactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 
175 aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggtgg ggtggcgatt 

17 gaStccaat t^T^ °^ Cttatt ttgcgacaga ccLcaacl 9 ^aaca'gcga" 
179 gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc tactcmcaa C 
181 tgcgatgttt ctgctcatga tgtcaccgtt actctgccgg acLcSgg ttcaSgcca 
183 attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacc? ctccqqcaca 
187 ITclalT 9 Cg9 r aaCtC ^attttcacc aataccgcg? cgttttcacc ajcgcagggc 
III ^ttgacgcg caacggtacg attattccag cgaataacac ggtatcgSa 

189 ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acqtaccqqa 
191 gggcagqtga ctacaacmaa t-rH-™ = ^ . gc ac y tac cgga 
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in sr^ ; ™ «™ ssss 

195 <211> LENGTH: 837 

196 <212> TYPE: DNA 

197 <213> ORGANISM: E. coli 

199 <400> SEQUENCE: 6 

200 ttcgcctgta aaaccgccaa tggcaccgct atccctattg gcggtggcag cgccaatatt fin 
202 tatgtaaaca ttgcgcccgc cgtgaatgtg gggcaaaacc iggLg^ggl tctttcgac^ 120 
206 ZTcalSt 9 t CCataaCga ttaccc ^- accattacag aSatgtSc actgcaacga ISO 
208 ITct Jltlt a ! ggC9gCgt Sttatctaat ttttccggga ccgtaaaata tagtggcagt 240 
lin agCtat ° Cat ttccgaccac cagtgaaacg ccgcgggttg tttataattc gagaacggat 300 
210 aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg ggSgSK 350 
111 gatScS ? at ! aattgC ^gcttatt ttgcgacaga ccLcaac?! taacagcgS III 
214 gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactgqcqqc 480 
218 aSSE Ct9 ^ Cgtga tgtcaccgtt actctgccgg acLccctgg ttcagtgccg 540 
218 attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccqqcaca 600 
220 accgcagatg cgggcaactc gattttcacc aataccgcg? Cgttttcacc tgcacaggqc 660 
222 gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac gJSSSS 720 
226 lllT^ ggaCttCg9C ^taagtctg ggattaacgg caaattacgc Icg'accgga 

229 tgtgCaatCg attattgCcg ^ottttgt tta'tcaa^ 

230 <211> LENGTH: 837 

231 <212> TYPE: DNA 

232 <213> ORGANISM: E. coli 
234 <400> SEQUENCE: 7 



235 ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag cgccaatatt 

239 SStttt ttg T CC9t ffggcaaaacS SgJLSggJ tcttSgacg 

239 caaatctttt gccataacga ttatccggaa accattacag actatgtcac actacaacaa 

2 tSc g t CggCgt gttatCtaat ttttccggga' ccgtaaaata £££££ 

245 ttcctaccac cagcgaaacg ccgcgcgttg tttataattc gagaacggat 

245 aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg gttgataatt 

2 ga a KSa g £ tlltT^ ttgcgacaga cca'acaacL' ?L^cgat 

^tttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc tactggcacrc 
251 tgcgatgttt ctgctcgtga tgtcaccgtt actctgccgg acLccg^gg 
253 attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacc? ctccggcaca 
255 accgcagatg cgggcaactc gattttcacc aataccgcg? cgttttcacc tgcacagggc 
257 gtcggcgtac agttgacgcg caacggtacg attattccaa cgaataacac ggtatcg??a 
259 ggagcagtag ggacttcggc ggtaagtctg ggattaacgg cLattacgc acgSccgga 
261 gggcaggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt tScaa 
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HAW SEQUENCE LISTING DATE : 07/23/2001 

PATENT APPLICATION: OS/09/900,575 TIME: 13:37:27 

Input Set : A:\FimH Peptides.ST25.txt 
Output Set: N:\CRP3\07232001\I900575.raw 

264 <210> SEQ ID NO: 8 

265 <211> LENGTH: 837 

266 <212> TYPE: DNA 

267 <213> ORGANISM: E. coli 
269 <400> SEQUENCE: 8 

"? tttgcctgta aaaccgccaa tggcaccgct atccctattg gcggtggcag cgccaatgtt 60 
272 tatgtaaact tggcgcccgc cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 
274 caaacctttt gccataacga ttatccggaa accattacag actatgtcac actgcaacga 180 
276 ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 
278 agctatccat ttccgactac cagcgaaacg. ccgcgggttg tttataattc gagaacggat 300 
280 aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggtgg ggtggcgatt 360 
282 aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 
284 gatttccagt ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc tactggcggc 480 
III t l'r 9 ^l tt ct 9ctcatga tgtcaccgtt actctgccgg actaccctgg ttcagtgcca 540 
288 attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct ctccggcaca 600 
290 accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc agcgcagggc 660 
292 gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 
294 ggagcagtag ggacttcggc ggtgagtctg ggattaacgg caaattacgc acgtaccgga 780 
III 9 ^caggtga ctgcagggaa tgtgcaatcg attattggcg tgacttttgt ttatcaa 837 

299 <210> SEQ ID NO: 9 

300 <211> LENGTH: 837 

301 <212> TYPE: DNA 

302 <213> ORGANISM: E. coli 
304 <400> SEQUENCE: 9 

l°* "cgcctgta aaaccgccaa tggtaccgca atccctattg gcggtggcag cgccaatgtt 60 
309 cgtgaatgtg gggcaaaacc tggtcgtaga tctttcgacg 120 



caaatctttt gccataacga ttacccagaa accattacag actatgtcac actgcaacaa 



311 ggttcggctt atggcggcgt gttatctagt ttttccggga ccgtaaaata taatggcagt 240 
J13 agctatcctt tccctactac cagcgaaacg ccgcgggttg tttataattc gagaacggat 



315 aagccgtggc cggtggcgct ttatttgacg ccggtgagca gtgcgggggg agtggcgatt 360 

317 aaagctggct cattaattgc cgtgcttatt ttgcgacaga ccaacaacta taacagcgat 420 

„? ? a + U f a9t ttgtgtggaa tatttacgcc aataatgatg tggtggtgcc cactggcggc 480 

321 tgtgatgctt ctgctcgtga tgtcaccgtt actttgccgg actaccctgg ttcagtgccg 540 

323 attcctctta ccgtttattg tgcgaaaagc caaaacctgg ggtattacct atccggcaca 600 

325 accgcagatg cgggcaactc gattttcacc aataccgcgt cgttttcacc cgcgcagggc 660 

327 gtcggcgtac agttgacgcg caacggtacg attattccag cgaataacac ggtatcgtta 720 

329 ggagcagtag ggacttcggc ggtaagtctg ggattaacgg caaattacgc acgtaccgga 780 

334 SlS 9 SS a i5 t SS? g S aa t9t9CaatC * attattggcg tgacttttgt ttatcaa 

335 <211> LENGTH: 840 

336 <212> TYPE: DNA 

337 <213> ORGANISM: E. coli 

339 <400> SEQUENCE: 10 

340 ttcgcctgta aaaccgccaa tggcaccgct atccctattg gcggtggcag cgccaatgtt 60 
Vaa tat9 * aaacc ttgcgcccgc cgtgaatgtg gggcaaaacc tggtcgtgga tctttcgacg 120 
344 caaatctttt gccataacga ttacccggaa accattacag attatgtcac actgcaacga 180 
346 ggctcggctt atggcggcgt gttatctaat ttttccggga ccgtaaaata tagtggcagt 240 
348 agctatccat ttccgaccac cagtgaaacg ccgcgggttg tttataattc gagaacggat 300 
350 aagccgtggc cggtggcgct ttatttgacg cctgtgagca gtgcgggcgg ggtggtgatt 360 
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RAW SEQUENCE LISTING DATE • 

PATENT APPLICATION : US/09/900,575 TIME: V/.f/^l 



Input Set : A:\FimH Peptides.ST25.txt 
Output Set: N:\CRF3\07232001\I900575 raw 

I^IIllpiilii 

370 <211> LENGTH: 837 

371 <212> TYPE: DNA 

372 <213> ORGANISM: E. coli 
374 <400> SEQUENCE: 11 

Hi ^ g ^ ctgta "wcgccaa tggtaccgca atccctattg gcggtggcag cgccaatatt 

£22£ SEC* 3252 SSS 2sfe 

SEES ESS* t= * 

385 aagccgtggc cggtggcgct ttattt™™ JvSS tttataattc gagaacggat 

387 aaagc?ggct cKtaaSgc cgtcctSt? Itllal^ ^tggcgatt 

I ESS 2S |= SHE SHsS 55 
1 a #5 55 =~~ S»= ~~ 

si S, :r»F ~ : «= s=S as™ 



405 <211> LENGTH: 840 

406 <212> TYPE: DNA 

407 <213> ORGANISM: E. coli 
409 <400> SEQUENCE: 12 



410 ttcgcctgta aaaccgccaa tggtaccgct atccctattg gcggtggcag ccxccaatati- 

414 otlltTtllt S C ° C9t T 9 ^^ ?gg?cg?ggl tcScgacg 

416 c\™ a ~ c C g \ a actatgtcac actgcaacga' 

418 agctatccat ttccta'ccac cagcgaaac^ c££2?£ S^S ta9tggCagt 
420 aagccgtggc cggtggcgct ttattta*™ l 9 tttataattc gagaacggat 
422 aaagciggct ca t c SSSSS ^tgagca ^gcgggcgg ggtggcgatt 

s ss ss IP s» ss» ss 

mmmmmm 
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440 <211> LENGTH: 837 

441 <212> TYPE: DNA 
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:12 M 
:12 M 
:2195 
:2453 
:2489 
:2531 
:2569 
:2654 
2732 
2796 
2808 
2820 
2832 
2844 
2856 
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:270 C 
:271 C 
M:220 
M:220 
M:220 
M:220 
M:220 
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M:220 
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M:220 
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VERIFICATION SUMMARY " 
PATENT S PP UCMI 0»: „ S/ „ V 900. 5 , 5 SS| J™™ 1 

Input Set : A:\FimH Peptides.ST25.txt 
Output Set: N:\CRF3\07232001\l900575.raw 
Current Application Number differs R.nipp^ n 

Current Filing Date differs LL^ p Current Application No 

, . y " aue aiders, Replaced Current Filing Date 



C. Keyword misspelled or invalid format, 
C: Keyword misspelled or invalid format, 
C: Keyword misspelled or invalid format, 
C: Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format, 
Keyword misspelled or invalid format 
Keyword misspelled or invalid format 
Keyword misspelled or invalid format' 
Keyword misspelled or invalid format,' 



C: 
C: 
C: 
C: 
C: 
C: 
C: 
C: 
C: 
C: 
C: 
C: 



<213> ORGANISM for SEQ ID#:46 
<213> ORGANISM for SEQ ID#:47 
<213> ORGANISM for SEQ ID#:48 
<213> ORGANISM for SEQ ID#:49 
<213> ORGANISM for SEQ ID#:50 
<213> ORGANISM for SEQ ID#:52 
<213> ORGANISM for SEQ ID#:55 
<213> ORGANISM for SEQ ID#:56 
<213> ORGANISM for SEQ ID#:57 
<213> ORGANISM for SEQ ID#;58 
<213> ORGANISM for SEQ ID#;59 
<213> ORGANISM for SEQ ID# : 60 
<213> ORGANISM for SEQ ID#:61 
<213> ORGANISM for SEQ ID#:62 
<213> ORGANISM for SEQ ID#-63 
<213> ORGANISM for SEQ ID#*64 
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